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CLASSIFICATION  OF  INTERMITTENT  DEPENDENT  OBSERVATIONS 


P.  A.  Jacobs  and  D.  P.  Gaver 


1.  THE  PROBLEM 

Consider  the  following  classification  problem.  Suppose  there  are  J  items 
(e.g.,  diseases)  each  of  which  has  a  characteristic  Signature  w^’ch  varies  in 
time;  the  Signature  of  Item  i  is 

Y/(f)  =  02'  +  X/(f)  i  =  1,2,...,/  f  =  0,1,2 .  (1.1) 

For  the  moment  {X,(/)}  is  an  unspecified  multivariate  (or  univariate) 
stochastic  process,  but  one  that  stays  near  0j  in  finite  time  and  has  some 
stationary  or  steady-state  behavior.  In  many  cases,  paths  of  X,  (f)  will  appear 
somewhat  "continuous,"  so  successive  X,(f)'s  are  not  well-modeled  as  iid 
random  variables.  One  could  think  of  Y;(f)  as  physical  indices  characteristic 
of  a  particular  disease,  e.g.,  blood  pressure,  heart-beat  pattern,  cholesterol 
levels.  Examples  from  equipment  reliability  are  also  of  interest;  here  physical 
indices  might  be  vibration,  variations  in  heat  level,  oil  leakage,  and  even  fuel 
consumption  in  the  case  of  engines. 

In  many  circumstances  Y,(f)  is  only  observable  occasionally,  at  times 
unrelated  to  the  value  of  Y,(f)  but  driven  by  other  forces  such  as  the 
scheduling  of  a  routine  physical  exam  or  system  inspection.  Suppose  that  the 
Signature  and  the  identity  of  the  item  associated  with  the  Signature  are  both 
observed  at  time  t  =  0,  on  such  an  occasion.  Suppose  that,  later  on,  however, 
only  the  Signature  of  an  item  is  observed.  The  first  question  is:  What  is  the 
probability  that,  given  the  Signature  value  observed,  its  originating  item  is 
any  particular  one  of  the  J  candidates? 

In  Gaver  and  Jacobs  [1989],  the  processes  (Xj(t)}  are  assumed  to  be 
univariate  Gaussian  and  a  Bayesian  classification  procedure  is  studied.  In  this 
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paper.  Section  2  assumes  (Xj(t)}  are  multivariate  normal  autoregressive 
processes.  In  Section  3,  {Kj(t)}is  a  univariate  Cauchy  autoregressive  process 
whose  marginal  distribution  has  longer  tails  than  the  Gaussian.  A  Bayesian 
classification  procedure  for  the  Cauchy  data  is  studied.  In  Section  4,  we  study 
the  behavior  of  the  univariate  Cauchy  and  Gaussian  classification  procedures 
when  autoregressive  data  having  the  wrong  marginal  distribution  are 
presented  to  them.  The  results  suggest  that  the  Gaussian  classification 
procedure  is  biased  towards  classifying  a  Signature  produced  at  time  t  as  being 
associated  with  the  same  item  that  produced  the  Signature  at  time  0.  The 
Cauchy  classification  procedure  is  biased  towards  classifying  a  Signature 
produced  at  time  t  as  being  associated  with  a  different  item  than  the  one 
producing  the  Signature  at  time  0.  These  effects  are  strongest  for  small  times 
t.  The  largest  number  of  misclassifications  occur  for  small  times  t  when  the 
Gaussian  classification  procedure  is  presented  with  Cauchy  data  and  a 
different  item  is  associated  with  the  Signature  at  time  t  than  the  item 
associated  with  the  Signature  at  time  0;  in  this  situation  the  Gaussian 
procedure  is  relatively  less  sensitive  to  the  change  in  the  item  associated  with 
the  Signature.  Misclassifications  by  the  Cauchy  classification  procedure  are 
modest  in  comparison  to  this  extreme  case. 

In  summary,  it  is  important  to  realize  that  the  performance  of  a  Bayesian 
classification  procedure  can  be  influenced  by  its  underlying  distributional 
assumptions.  A  classification  procedure  based  on  Gaussian  distributional 
assumptions  can  be  reluctant  to  classify  a  new  observation  coming  from  a 
different  item  as  being  associated  with  a  new  item.  A  classification  procedure 
based  on  Cauchy  distributional  assumptions  can  be  reluctant  to  classify  a  new 
observation  which  comes  from  the  same  item  as  that  being  associated  with 
the  same  item.  Hence,  if  there  is  uncertainty  about  the  underlying 
distribution  of  the  data,  it  might  be  better  to  combine  results  of  several 
classification  procedures  based  on  different  distributional  assumptions. 
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2.  THE  MULTIVARIATE  NORMAL  CASE 


2.1  The  Classification  Question 

Assume  for  illustration  that  the  Signature  of  Item  j  is  multivariate  AR(1): 

Y;(O  =  0/+X/(f)  (2.1) 

where  0y,  Y^f),  and  Xj(t)  are  d-dimensional  column  vectors.  The  process 
jXy(f)j  is  a  vector  AR(1)  process 

Xy(f)  =  AjXj(t  - 1)  +  Ey(f)  (2.2) 

where  Ay  is  a  dxd  matrix  and  |Ey(f)|is  a  sequence  of  d-dimensiona'  column 

vectors  which  are  independent  multivariate  normal  with  mean  0  and 
variance-covariance  matrix  Ay.  The  variance-covariance  matrix  for  Xy(t  + 1) 

is 

ry(t + 1)  =  e|x(i + 1)XT(I + 1)]  =  A,r,(t)A[  +  a,.  (2.3) 

We  will  assume  A y  and  A y  are  such  that  there  is  a  finite  unique  solution  to 
the  equation 

r j  -  A,r/A[  +  A;.  (2.4) 

Assume  X^(0)  has  a  normal  distribution  with  mean  0  and  variance- 

covariance  matrix  Ty.  It  follows  that  |Xy (f )|  is  a  stationary  sequence  with 
mean  0  and  variance-covariance  matrix  Ty- 

The  conditional  distribution  of  X y ( f )  given  X^(0)  =  x  is  multivariate 
normal  with  mean  Ayx  and  variance-covariance  matrix 
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(2.5) 


A;(0=  Sa;A;-(a")T. 

n=0 

Thus,  r.-  =  lim  A  At). 

>  f — >oo  7 

The  conditional  distribution  of  the  actually  observable  Y j(t)  given 
Yj(0)=y(0)  is  multivariate  normal  with  mean  0j+A j(y(0)-8j)  and  variance- 

covariance  matrix  Aj(t). 

Operational  Scenario:  There  are,  potentially,  }  items.  Let  C(t)  be  the  identity 
of  the  item  whose  Signature  is  observed  at  time  t.  Put  pj(t)  =  P(C(t)=j}. 
Assume  that  it  is  known  that  the  Signature  observed  at  time  0  comes  from 
Item  i;  that  is,  C(0)  =  i  and  Y(0)  =  Yj(0)  =  y(0).  If  it  has  been  a  long  time  since  a 
Signature  from  item  i  has  been  observed,  it  is  reasonable  to  suppose  that 

P{C(0)  =  i,Y(0)  =  y(0)} 

=  Pi(O)((2'0‘<|r,'|)~O  5exp|--l(y(0)  -  e,)Tr“1(y(0)  -  8,))  (2.6) 

the  long-run  or  steady-state  distribution.  Further, 

P{C(I)  =  i,  Y(<)  =  y(0|C(0)  =  i,Y(0)  -  y(0)} 

=  PiO)  exp{-.i(y(/)-m,(/))TS,(/)-1(y(()-m,(/))}  (2.7) 

where 

m/(l)  =  8,  +  A'(y(0)-e,-)  (28) 

and 

HO  =  A,(t).  (29) 
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For  j  *  i,  we  will  assume  the  conditional  distribution  of  Y(t)  given  C(0)  =  i, 
Y(0)  =  y(0),  C(t)  =  j  is  multivariate  normal  with  mean  mj(t)  -  0j  and  variance- 
covariance  matrix  T,  =  £j(t),  since  it  is  still  a  long  time  since  a  Signature  from 
Item  j  is  observed. 

It  now  follows  that 

p{Y(f)edy(0|C(0)  =  i,Y(0)  =  y(0)} 


Thus,  the  posterior  probability  of  the  identity  of  the  item  associated  with 
the  Signature  is 

P{C(t)  =  /|C(0)  =  i/Y(0)  =  y(0),  Y(t)  =  y(f)}  = 

Pji 0|£;(0|  °’5exp{-^(y(f)-m;(0)T£/(fr1(y(0-m;(0)| 


(OlMOl  °'5 exp{-^(y(0 -  mk(0)T Zk(0  \y(0 - mjt(f))J 


i-i 


•  (2-11) 


2.2  The  Probability  of  an  Incorrect  Classification 

In  this  section  we  assume  that  the  item  that  is  associated  with  the 
Signature  at  time  t  given  the  last  complete  observation  at  time  0  will  be 
estimated  to  be  that  one  which  maximizes  the  posterior  probability  (2.11). 

For  a  simple  illustration  we  will  suppose  that  there  are  only  J=2  possible 
items  with  known  parameters  0i  and  02- 

Given  Y(0)=y(0),  C(0)=1,  and  C(t)  =  1,  the  conditional  distribution  of  Y(t)  is 
multivariate  normal  with  mean 
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">l(O"01  +A'(y(0)-81) 


(2.12) 


and  variance-covariance  matrix 


£i(0  -  X(Ai )  AiAi- 


(2.13) 


Let  the  matrices  Hj(t)  and  H;  be  such  that 


H,(l)H,.(i)T  =  £,(() 


(2.14) 


H,H,T  =  r, 


(2.15) 


It  follows  that 


Y(0-m1(f)+  H|(f)U 


(2.16) 


where  U  is  a  d-dimensional  column  vector  each  of  whose  components  are 
independent  standard  normal  random  variables;  the  notation  =  means  equal 
la  dislribudo.'i.  Thus,  given  Y(0)  =  y(0>,  C(0)  =  1,  C(t)  =  1.. 


( Y(()  -  m, (f))T£i(0“1(V(t)  -  mT(t))-U'  U 


(2.17) 


(Y(()  -  m2(())f  E2(!)  1(Y(')-n'2(')) 


-(n'l(')-n>2(')+lll(')|J)Ti;2(I)  1(m,(l)-m2(l)  +  H](»)U)  (2.18) 


where  m2(t)  =  02  and  Z2U)  =  T2.  Thus,  the  probability  of  a  misclassification  is 
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pjclassify  the  item  as  2|C(0i  =  1,Y(0)  =  y(0),C(f)  =  l} 

=  P|^2(')|^2(0r  "  S  t*xp<[ -i(m ,(()  -  m2(f )+  Hi(f)*J)TE2(<r’(m,(f)  -  m2<0  +  H,(t)U)} 

>;>,(f)iEi(ora5«p{~urujj 

1 

=  /4^(lyj‘j|2  >oxp|-iu7  U  +  i(a(()+H,(()lJ)Tn1(a<0+H,(l)lJ)}  (2  19) 
where 

a(/)  =  01+Ai(y(O)-01)-02.  (2.20) 

Example:  Assume  A,  =  A,  A,  =  A  for  i  =  1,  2,  and  pi(t)  =  p2(t);  then 

pjwrong  classification  |C(0)  =  1,  Y(0)  =  y(0),C(f)  =  l} 

1 

=  2  >exp|-iurU  +  ^(a(f)+H(f)U)Tr-1(a(f)  +  H(t)U)J.  (2.21) 

where 

t- 1  T 

A(0=  5>‘a(a1)  s  I1(|)11(0T; 

k=0 

T  =  lim  A(f)  =  Ml7  is  the  solution  to  the  equation 

/— >oo 

r  =  ArA7  +  A; 

and 

a(/)  =  e,  +  A'(y(0)-e,)-®2 

Note  that  as  t— >°° 
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P | wrong  classification  |C(0)  =  1,  Y(0)  =  y(0),C(f)  =  lj 

->  pji  >exp|-^u7'u  +  ^-(0]  -e2  hiiu)1  r-1^  -e2  +  nu)|| 

=  rji  >exp|t(e, -e2)T(iiii7yl(e1  -e2)+(e,  -e2)T(HT)',u  jj 
=  f>!^i(e1-o2)T(iiH1')''(e1-e2)>(ei-o2)r(uT)"1uL  (2.22) 


A  Simulation 

Table  1  gives  the  results  of  a  simulation  experiment  for  the  case  0]  =  (1,1)T 


0.1  0.9 
0.4  0.5 


and 


In  this  case, 


0 

1 


5.4 

3.8 

_  3.8 

4.5  _ 

Figure  1  shows  contours  from  a  bivariate  normal  distribution  having  mean 
0i  and  variance-covariance  matrix  T. 

In  each  replication  two  independent  vector  random  variables  are 
generated;  one  is  V ( 0 )  which  has  a  normal  distribution  with  mean  0]  and 
variance-covariance  matrix  T;  the  other  is  U,  whose  components  are  two 
independent  standard  normal  random  variables.  For  each  time  t  =  1,2,  40, 

Y(t)  is  calculated  as 
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Y(t)  =  m(t)  +  H(t)U. 


(2.23) 


with  m(t)  =  0i  +  A*  (y(0)  -  0i);  Y(t)  has  the  same  distribution  as  a  Signature 
from  Item  1  when  the  Signature  at  time  0  is  also  from  Item  1.  There  are  1000 
replications.  Table  1  presents  the  fraction  of  replications  for  which  the 
incorrect  classification  is  made  of  Item  2  being  the  one  producing  the 
Signature  at  time  t;  that  is  those  replications  for  which 


A(0| 


>exp{^(y(0-fl2)Tr  1(y(0-O2)-^(y(0-Oi)TA(t)  1(y(f)-o1)|. 


(2.24) 


Note  that  the  fractions  are  not  independent  since  common  random  numbers 
are  used. 

The  contours  of  the  distribution  in  Figure  1  suggest  that  it  is  more  likdy 
to  make  a  misclassification  if  02  =  (2,2)T  than  if  02  =  (-2,2)T;  the  fractions  in 
Table  A  support  this.  The  fractions  in  Table  A  also  suggest  that  the  probability 
of  misclassification  is  an  increasing  function  of  t.  This  observation  is 
supported  by  the  fact  that  the  variances  of  the  components  of  Y(t)  increase  as  t 
increases. 


TABLE  A.  FRACTION  OF  MISCLASSIFICATION 

0i  =  (1,1)T 


Time: 

1 

2 

3 

4 

5 

10 

20 

30 

40 

02  =  <2,2F 

0.10 

0.13 

0.16 

0.20 

0.21 

0.30 

0.39 

0.41 

0.41 

02  =  (-2,2)T 

0.04 

0.06 

0.06 

0.07 

0.07 

0.09 

0.09 

0.09 

0.09 
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3.  CAUCHY  UNIVARIATE  MODEL 

In  this  section  we  consider  Bayesian  classification  for  a  time  series  model 
having  marginal  distributions  with  a  longer  tail  than  the  Gaussian 
distribution. 

We  assume  that 

Yj(t)  =  0i  +  Xi(t) 

with 

Xi(t)  =  PiXi(t-l)  +  £i(t) 

where  I  p*  I  <1;  {£j(t)}  are  independent  sequences  of  independent  identically 
distributed  Cauchy  random  variables  with  location  parameter  0  and 
precisions  j(l  -|p,|)ay  j  ;  and  Xj(0)  has  a  Cauchy  distribution  with  parameters 
0  and  cxj05  Under  these  assumptions  {Xj(t);  t  =  0,  1,  2,  ...)  is  a  stationary 

sequence  of  random  variables  with  marginal  Cauchy  distribution  having 

,  -0.5 

parameters  0  and  a, 

It  follows  that 

P{Yl(0)edy(0)/Yi(t)edyI(f)} 

-i  r  2  _-i-i 

=  -«/  af+(y(Q)-  °if  (ai(1-Mt))  +(y(O-0,-p'(y(o)-0,)) 

(3.1) 

Let  C(t)  denote  the  identity  of  the  item  associated  with  the  Signature  at 
time  t  and  put  P{C(t)=i)=p,(t);  then 

P{y(l)  €  dy(t),C(t)  =  i|C(0)  =  i.V(O)  =  y(0)} 

1/2  2_  1 

=  Pi  (0-  «i{ 1  - 1  Pit )  ( «i(i  - 1  Pit ))  +  (y(0  -  e,  -  p\  (y(o)  -  0j )) 
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=  p/(0-«i(0f«i(02+(y(0-w/(0)2l  •  (3-2) 

n 

Thus, 

|  I  ■  2  ^ 

p{r(t)  e  dy{t)\C(0)  =  i, Y(0)  =  y(0)}  =  £p;-(0“«/(0  a/({)2  +  (y (0 "  m/(0) 

H  *  L 

where  cxj(t)  and  mj(t)  are  defined  in  (3.2)  and  it  is  natural  to  define 
mj(t)  =  6j  and  aj{t)  =  ay  for  j  *  i.  Hence,  given  item  i  is  associated  with  the 

Signature  at  time  0,  the  posterior  probability  that  item  j  is  associated  with  the 
Signature  observed  at  time  t  is 

p{C(f)  =  j\C(0)  =  i,Y(  0)  =  y(0),Y(f)  =  y(0}  (3-3) 

r  2  r1 

=  Pj{t)aj(t^a}(t)2  +  (y(t)  -  my(0) 

x|Zp^(fK(^)[ajt(02+(y( 0~mit(f))2]  j  ■ 

3.2  The  Probability  of  Making  an  Incorrect  Classification 

In  this  section  we  assume  that  the  item  associated  with  the  Signature  at 
time  t  given  the  last  complete  observation  at  time  0  is  estimated  to  be  that  one 
which  maximizes  the  posterior  probability  (3.3).  For  simplicity  we  will 
suppose  there  are  J=2  possible  items  with  known  parameters  0]  and  02- 
First 

p{r(l)  e  dy(f)|V(0)  =  y(0),C(0)  =  l.C(f)  =  /} 

=  (a>(0)2+(yW-"</(0)2  l3  4> 

where 
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«l(0  =  «l  1-|Plf}«2(0  =  «2  (3-5) 

mi(f)  =  0i+pi(y(O)-fli);m2(O=  »2-  (3-6) 

Note  that  given  Y(0)  =  y(0),  C(0)  =  1  and  C(t)  =  1, 

n<)-[ei+pi(y(o)-ei)]+(i-|pi|')«iW  =  mi(f)+a,(()w 

where  W  is  a  Cauchy  random  variable  with  location  parameter  0  and 
precision  1.  Hence,  the  probability  of  making  the  incorrect  classification  of 
estimating  Item  2  as  being  associated  with  the  Signature  at  time  t  given  Item  1 
is  responsible  for  Signatures  at  time  0  and  time  t  and  Y(0)  =  y(0)  is 
PjClassify  as  Item  2|C(0)  =  l,y(0)  =  y(0 ),C(t)  =  l} 

=  > ai(t)[“i(<)2  +(Y(f)~ mi(<))2]~1|c(°)  =  U(0)  =  y(0), c(t)  =  lj 

=  P{^j^y[«2  +  («l(0  +  «l(0WV  -  92)2Y  >  [«!(02  +(«i(t)W)2]~1|C(0)  =  1,Y(0)  =  y(0),C(t)  =  lj 

p~(t)  [«2 +(ml(0  +  «l(0w-«2)2] 

_  dJ  "Zy!  t\  _ _ 1  <-vn\  _  i  wrv>  -  <~n  \  -  i  L 


l|C(0)  =  l,Y(0)=y(0),C(f)=l 


Note  that  as  t->0,  oti (t) — >0,  and  mi(t)-»y(0).  Hence,  the  conditional  probability 
of  a  wrong  classification  tends  to 


P  0> 


«2+(^°)~6*2)2 


1  +  W‘ 


As  t— »«,  ai(t)->ai,  mi(t)— >0]  and  the  conditional  probability  of  a  wrong 
classification  tends  to 
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*  —  I* 


p{  piH  “W1  +  w2)  >  [a^  +  (aiW  +  01 "  02)2]}'  (39) 

If  a2  =  ai  =  a  and  p2(°°)  =  pi(°°),  then  as  t-4°<> 

Pjincorrect  classification!  Y(0)  =  y(0),C(0)  =  l,C(f)  =  l} 


=PH^I1  (3,0) 

which  increases  as  a  increases  and  decreases  as  1 9i  —  02 1  increases. 
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4.  ARE  BAYESIAN  CLASSIFICATION  PROCEDURES  ROBUST? 

In  this  section  the  robustness  of  the  univariate  Cauchy  and  Gaussian 
classification  procedures  against  misspecification  of  the  form  of  the  marginal 
distribution  will  be  studied. 

4.1  Gaussian  Data. 

In  this  subsection  we  assume  that  the  Signatures  of  the  Items  form 
Gaussian  time  series.  In  particular  we  assume  that 

Yi(t)  =  0i  +  Xj(t)  (4.1) 

with 

Xi(t+1)  =  piXi(t)  +  £j(t)  (4.2) 

where  (£j(t)}  are  independent  identically  distributed  normal  random  variables 
with  mean  0  and  variance  of  and  !  pi  |  <1.  The  independent  random  variable 
X;(0)  has  a  normal  distribution  with  mean  0  and  variance 
=  af  /(l-pf)-  Thus  {Xj(t),t>0}  is  a  stationary  sequence  of  normal 
random  variables  with  mean  0  and  variance  aj(°°)2.  Let  C(t)  be  the  identity  of 
the  Item  associated  with  the  Signature  at  time  t. 

As  was  shown  in  Gaver  and  Jacobs  (1989),  the  conditional  distribution  of 
Y(t)  given  Y(0)=y(0),  C(0)=i,  C(t)=i  is  normal  with  mean 

nii(t)  =  0t  +  (y(0)  -  di)p\  (4.3) 

and  standard  deviation 

°i(t)  ~  ~ P2t  ■  (4-4) 
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For  simplicity  we  will  assume  P{C(t)  =  i}  =  pj(t)  =  p(t)  and  there  are  2  Items 
with  parameters  0i  and  82;  thus,  p(t)  =  ^ 

Suppose  the  Cauchy  procedure  is  used  to  estimate  the  identity  of  the  Item 
associated  with  the  Signature  at  time  t;  that  is,  the  Item  which  maximizes  the 
posterior  probability  (3.3)  is  the  estimate  of  the  Item  associated  with  the 
Signature.  Hence,  the  probability  of  an  incorrect  classification  is 

pjClassify  as  Item  2|C(0)  =  1,  Y(0)  =  y(0),C(t)  =  l] 


=  + °^)z -  02)2]  >  [«i(02  +  (01  (0Z)2] 

' 

2 

=  P'  a2(t)ai(t)[o:2(f)2+K(f)+<Ti(t)Z-02)2]  >  1  + 


=  P  «2«i (l  -  |Pl |f  )[«2(f )2  +  (^1  (0  +  cr, (f)Z  -  »z)2 ]  ’>  1+ 


where  Z  is  a  standard  normal  random  variable. 

Note  that  as  t— >0 

p{Classify  as  Item  2|C(0)  =  1,Y(0)  =  y(0),C(f)  =  l} 

r  -i2  It 

=  ?•  a2a1fl-|p1fYa|+(m1(f)  +  <T1(f)Z-02)2l  >  1  +  -  —  2Z2 

'  L  J  L  «i  J  p,!')2 
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=  p 


«2«1  (]  -  IPiI'  )  «2  +  (”»l(0  +  W  ~  6lY 


r> 

i+ 

CTi(°o) 

2  1  +  |Pl|f  v2 

-1 

j 

_ 

.  «1  . 

i-N'  J 

=  P 


a1a2(l-lPlf)[a|  +  (ml(t)  +  ^l(<)z-^)2  >(1-lPl|f^(1-hl‘)+  (l+|Plf) 


=  P 


al+{ml{t)+oi(t)z-62Y 


<  ai«2 


(i-N‘)+  ^  (1+W')Z 


L  «i  J 


«1«2 


tt2  +(y(°)_p2) 


< 

CTi(oo) 

2 

2Z2 

.  «1  . 

-4- 


[2a20rl(oo)": 


<xl+(y(o)-oiY 


<Z 1 


(4.6) 


Thus,  the  conditional  probability  of  an  incorrect  classification  does  not  tend  to 
0  as  t-»o°  as  it  would  if  the  correct  model  were  used;  see  Gaver  and  Jacobs 
(1989)  (3.6). 

Note  that  as  t— »«> 

PjClassify  as  Item  2|C(0)  =  1,Y(0)  =  y(0),C(f)  =  l} 


=  P 


«2«l 


a|  +(0]  +  (J1(oo)Z-62)2j 


i+(fiWz 

l  «l 


-l 


(4.7) 


If  ai  =  a2  =  1,  then  the  above  equals 


=  p{(ff,Kz)2  >  (<r,(-)z  +  e,  -  #2)2} 


=  P\ZL  > 


z  + 


(01-02) 
<*\  H 


z> 


101-0211 
2oi (°°)  J 


(4.8) 


which  is  the  same  as  if  the  correct  model  had  been  used  to  make  the  decision; 
see  (3.9)  of  Gaver  and  Jacobs  (1989). 

Now  we  consider  the  case  in  which  a  different  Item  is  associated  with  the 
Signature  at  time  t  than  the  one  associated  with  the  Signature  at  time  0.  Once 
again  for  simplicity  we  assume  01  =  02  =  o,  pi  =  P2  =  p  with  |  p  1  <  1  and  for  the 
Cauchy  model  oti  =  012  =  a.  Let  o(°°)  =  0/  yj  1-p2  and  a(t)  =  a(°°W  l-p2t.  We 
will  assume  Item  1  is  associated  with  the  Signature  at  time  0  and  Item  2  is 
associated  with  the  Signature  at  time  t. 

For  the  Gaussian  classification  procedure  of  Gaver  and  Jacobs  (1989),  the 
probability  of  an  incorrect  classification  is 
P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2) 

=P{(l-p2t)  exp{~2  [62  +  o(°°  )Z-(0i+  (yfOMl^p1)]  /g(°°)  (1-p21)} 

>exp{~2  (02+o(°°  )Z-02)  / o(°o)  }} 

=V{-\  In  (l-p2t)-|f02  +  0(00  )Z-(0]+  (y(O)-0i)p‘)]2/{o(oo)2(l-p2t)} 

>~2  (o(°°  )Z)  /o(oo)  } 

=P{  (l-p2‘)o(oc)2  In  (l-p2‘)  +[62  +  0(00  )Z-(0,+  (y(O)-0,)p‘)]2 

<(l-p2‘)  (o(oo)Z)2}  (4.9) 
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where  Z  is  a  standard  normal  random  variable.  As  t— >0 


P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2}  ->  P 


62-  y(Q)f 

a(°°)  J 


=  0; 


(4.10) 


that  is,  if  the  Item  associated  with  the  Signature  at  time  t  is  different  than  the 
one  associated  with  the  Signature  at  time  0,  then  as  t— >0,  the  probability  of  an 
incorrect  classification  using  the  Gaussian  procedure  on  Gaussian  data  tends 
to  0. 

As  t-»«>,  the  probability  of  an  incorrect  classification, 

Pfclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2) 


->P 


e2  -  e  a2 

O(oo) 


=  PZ< 


92  ~  6i I  ] 

2o(°o)  [■ 


(4.11) 


Suppose  now  the  Cauchy  classification  procedure  is  used  on  the  Gaussian 
data  with  Item  2  associated  with  the  Signature  at  time  t  and  Item  1  associated 
with  the  Signature  at  time  0.  The  probability  of  an  incorrect  classification 
Pfclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 


2  2 

=P{a(l- 1  p  |  *)[[a(l- 1  p  |  t)]2+[02+o(°o)Z-[0i+(y(O)-0])p1]]  ]  >a(a2+(02+a(~>)Z-02)  ]  } 

=P((1- 1  p  1 0)  [a2  +  (a(°o)Z)2]>[(a(l- 1  p  I ())2  +  [02  +  a(°o  )Z  -  [9,  +  (yfOl-e^p']]2]).  (4.12) 
As  t — >0 

P{classify  as  1 1  Y(0)  =  y(0),  C(0)  -  1,  C(t)  =  2) 

— >P{O>[02  +  a(oo)Z  ~  y(0)]2)  =  0;  (4. 13) 
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that  is,  as  t— >0,  the  probability  of  a  correct  classification  for  the  Cauchy 
procedure  tends  to  1  for  the  case  in  which  the  Item  associated  with  the 
Signature  at  time  t  is  different  from  the  one  associated  with  the  Signature  at 
time  0,  even  though  the  data  are  Gaussian. 

As  t  — >  °° 

Pfclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 


Hence,  as  t— »<*>  the  probability  of  an  incorrect  identification  tends  to  the  same 
normal  tail  probability  for  both  the  Cauchy  and  Gaussian  classification 
procedures. 

Thus,  for  the  two  limiting  cases  t-*0  and  t->°°,  both  the  Cauchy  and 
Gaussian  procedures  have  the  same  misclassification  probabilities  for  the 
scenario  in  which  the  Item  associated  with  the  Signature  at  time  t  is  different 
than  the  one  associated  with  the  Signature  at  time  0  Note  that  these  are 
theoretical  limiting  results  with  all  parameters  known. 

To  investigate  further  the  behavior  of  the  two  classification  procedures 
on  Gaussian  data  when  Item  1  is  associated  with  the  Signature  at  time  0  and 
Item  2  is  associated  with  the  Signature  at  time  t,  let 

gt(y(0),Z)  =  (ez+oMZ-tei+fyfOl-eOpM)2 

The  conditional  probability  of  an  incorrect  classification  by  the  Gaussian 
procedure  is  from  (4.9) 
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Pjciassify  as  Item  l|V (0)  =  y(0),C(0)  =  l,C(f)  =  2} 

=P{(l-pa)o(oo)2  In(l-p2t)  <  (1-P2t)  (o(oo)Z)2  -gt(y(0)/Z)) 

>P((l-p2‘)o(oo)2  in(i_p2t)  <  (1- 1  p  1 1)  (o(oo)Z)2  -gt(y(0),Z)} 

>P(-a2(l- 1  p  1 0  I  p  |  ‘  <  (1- 1  p  I  *)  (a(oo)Z)2  -gt(y(0),Z)} 
for  t  sufficiently  close  to  0.  From  (4.12)  it  follows  that  the  conditional 
probability  of  misclassification  for  the  Cauchy  procedure  is 

P{(1- 1  p  I  ‘)(c(oo)Z)2  -gt(y(0),Z)  >  a2(l- 1  p  1 0  [1- !  p  I M]}. 

Hence  for  t  sufficiently  small,  the  incorrect  Cauchy  procedure  will  tend  to 
have  fewer  misclassifications  than  the  Gaussian  procedure  applied  to 
Gaussian  data  in  the  scenario  in  which  different  Items  are  producing  the 
Signatures  at  time  0  and  t. 

4.2  Cauchy  Data 

In  this  subsection  we  assume  the  Signatures  form  time  series  with 
Cauchy  marginal  distributions  as  in  Section  3.  In  particular,  we  assume  that 

Yj(t)  =  6i  +  Xi(t)  (4.15) 

with 

Xj(t+1)  =  piXj(t)  +  £j(t)  (4.16) 

where  {£,(t)}  are  independent  identically  distributed  Cauchy  random  variables 

with  location  0  and  precision  [(1- 1  pj  I  )a;]-°-5  with  |  p*  |  <1.  The  independent 

random  variable  Xj(0)  has  a  Cauchy  distribution  with  location  0  and  precision 
-1/2 

otj  .  Under  these  assumptions  (Xj(t)}  is  a  stationary  sequence  of  random 

-1  /2 

variables  with  marginal  Cauchy  distribution  having  parameters  0  and  a ; 

Further,  the  conditional  distribution  of  Y(t)  given  Y(0)  =  y(0),  C(0)  =  i,  C(t)  =  i 
is  Cauchy  with  location  parameter 
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mi(t)  =  04  +  p‘(y(0)  -  0;) 


(4.17) 


and  precision  parameter  ^(t)-1/2  with 

ai(t)  =  ai(l-|pi|‘).  (4.18) 

Let  C(t)  be  the  identity  of  the  Item  associated  with  the  Signature  at  time  t. 

For  simplicity  we  will  assume  there  are  two  items  with  parameters  0i  and 
02-  Further  P{C(t)  =  i}  =  p*(t)  h  p(t). 

Suppose  the  Gaussian  procedure  of  Gaver  and  Jacobs  (1989)  is  used  to 
estimate  the  identity  of  the  item  associated  with  the  Signature  at  time  t;  that 
is,  the  item  which  maximizes  (2.12)  of  Gaver  and  Jacobs  (1989)  is  the  estimate 
of  the  Item  associated  with  the  Signature  at  time  t.  Hence,  the  probability  of 
an  incorrect  classification  is 

PjClassify  as  Item  2|C(0)  =  1,  Y(0)  =  y(0),C(t)  =  l} 


=  P 


o2(°°) 


exp 


(no- e2j 

<J2(00) 


i2 


[ >  <*i(0 


exp 


no-™i(o 

<*i(0 


C(0)  =  1, 

v(o)  =  y(o), 

c(0  =  i 


(4.19) 


where 


Wi(f)  =  01  +pJ(y(O)-0!) 

(4.20) 

<T1(f)  =  01(°o)^l-p12' 

(4.21) 

OlH  =  /  y(l  -P!2);a2H  =  <y2  /y(l -p|) 

(4.22) 
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and  Oj,  i  =  1,  2  are  the  assumed  standard  deviations  of  the  normal 
distributions.  We  will  assume  G}  =  02  =  c;  pi  =  P2  =  P,  cti  =  a2-  Hence, 

p[Classify  as  Item  2|C(0)  =  1,Y(0)  =  y(0),C(f)  =  l} 


=  P^ 


cr(o°) 


exp 


/m1(t)  +  «(t)W-02x2 

a(o°) 


2V 


f  a{t)W^2 


W)exp\-2{-W) 


(4.23) 


where  W  is  a  Cauchy  random  variable  with  location  parameter  0  and 
precision  1. 

pjClassify  as  Item  2jC(0)  =  1,  Y(0)  =  y(0),C(f)  =  l} 


=  P- 


V- 
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p  >  exp 


2  2 

1  («[HpI']J  2  1  (pl(»(°)-ei)'i~(fl'-t,2)+a(|)w) 

2  (l-p2f)a(«)2  2 


0{°°Y 


=  P 


/ 1  Zu  -  .  ... .  _ 

1  «2 

1-lPl' 

ttt2  1(/(y(o)-0i)  +  (01-02)  +  «(Ow)2 

Jl-p  >exp( 

2(l+|p)f)a(oo 

_  w  +  0 

)2  2  a(oo)2 

(4.24) 


/->  0 


->P 


0  >  exp 


(y(o  )-o2y 


<y{°°Y 


=  0; 


thus,  the  probability  of  misclassification  tends  to  zero  as  t— >0  even  though  the 
incorrect  model  is  being  used;  the  correct  Cauchy  procedure  also  has  a 
probability  of  misclassification  tending  to  zero  as  t->0.  As  t->« 

(’[Classify  as  Item  2|C(0)  -  1,Y(0)  =  y(0),C(f)  =  lj 

i2 


=  P 


f  1  a2W2  1  [6]  -  0?  +  aW  I' 
1  >  exp^--— — - T  +  - - —2 - 

2  o(°°)  2  ff(oo) 
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=  p 


1  >  exp 


5^H?[(9l'e2)2  +  2(e''e2)aVV) 


p|o>I(e1-e2)2+(91  +  e2)aw} 


=T<4>-e2|} 

which  is  the  same  as  (3.10)  the  corresponding  probability  when  the  correct 
Cauchy  procedure  is  used. 

To  further  explore  the  behavior  as  t— >0,  let 

gt(W,y(0))  =  [p*(y(0)-ei>  +  (61-62)  +  a(l- 1  p  |  l)W]2 

and 

B(t)  =  a2(l-|p|‘)W2. 

For  t  small  (4.24)  becomes 

PjClassify  as  Item  2|Y(0)  =  y(0),C(0)  =  l,C(f )  =  lj 

=P{o(~)2(l  +  |  p  1 1)  ln(l-p2t)  +  B(t)  >  (1+  |  p  |  ‘)  gt(W,y(0))} 

<P{-a2 1  p  I  l+B(t)  >  (1  +  |  p  I  t)gt(W,y(0))) 

<P{-a2 1  p  I  l+B(t)  >  gt(W,y(0))) 

=P{a2(l-  I  p  I l)  +B(t)  >  a2+gt(W,y(0))j 

=P{(a(l- 1  p  |  ‘))2  +  (1- 1  p  I ')  B(t)  >  (1- 1  p  |  ‘)  [a2+gt(W,y(0))]} 
=P(a(a2+gt(W/y(0))]  >  a(l-|  p  |  ‘)  [(a(l- 1  p  I  “)]  +  (a(l- 1  p  I  *)W)  ]  } 
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which  is  the  conditional  probability  of  misclassification  for  the  Cauchy 
procedure  on  Cauchy  data.  Hence  for  small  t,  the  incorrect  Gaussian 
procedure  will  tend  to  have  fewer  misclassifications  than  the  correct  Cauchy 
procedure  for  the  scenario  in  which  the  same  item  is  associated  with  the 
Signature  at  both  times. 

Now  we  consider  the  case  in  which  the  Item  associated  with  the 
Signature  at  time  t  is  different  than  the  one  associated  with  the  Signature  at 

time  0.  Once  again  for  simplicity  we  assume  ai  =  a  2  =  a,  pi  =  p2  =  P  with 

.  ,  0.5 

Ip  |<1  and  for  the  Gaussian  model  oi  =c 2  =  0.  Let  g(°°)  =  o/(l-p2)  and 

0.5 

o(t)  =  o(°°)  (1— p2t)  .  We  will  assume  Item  1  is  associated  with  the  Signature 
at  time  0  and  Item  2  is  associated  with  the  Signature  at  time  t. 

For  the  Gaussian  classification  procedure  of  Gaver  and  Jacobs  (1989)  the 
probability  of  an  incorrect  classification  is 
Plclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

=P{(l-p2')^3  expl-^  [02  +  aW  -  (0,  +  p‘(y(O)-0i))]7a(»°  )2(l-p2‘)} 

1  2  2 
>exp{-^(02+aW-02)  /a(°°)  )} 

=P{-^  In  (l-p2t)  -  ^  [02  +  aW  -  (0i  +  p‘(y(O)-0i))]  /|o(°°  )  (l-p2t)J 
(aW)  /o(°°  )  } 

=P{o(°°  )2(l-p2‘ )  In  (l-p2t )  +(02  +  aW-(0i  +  p‘(y(O)-0,)))2  (4.26) 

<(aW)2(l-p2')) 

where  W  is  a  standard  Cauchy  random  variable. 

As  t — >0,  the  probability  of  an  incorrect  classification, 

P[classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 
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->P{(02+aW-y(O))  <  0}  =  0. 

Hence  as  t->0,  the  probability  of  an  incorrect  classification  tends  to  0  for  the 
Gaussian  procedure  on  Cauchy  data. 

As  t— ><»,  the  probability  of  an  incorrect  decision 

Plclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2}->P{(02  +  aW-Gj)2  <(aW)2} 


0  _ q  2  2 

=P{(W+^1-)  <  W  }  =  P{W> 


I  e2-ei  1 

2a 


}• 


Suppose  now  the  Cauchy  classification  procedure  is  used  on  the  Cauchy 
data  with  Item  2  associated  with  the  Signature  at  time  t  and  Item  1  associated 
with  the  Signature  at  time  0.  The  probability  of  an  incorrect  classification  is 
Plclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

=P{a(l- 1  p  I  l)[a2(l-  |p|‘)2  +  [02+aW-(01+p‘(y(O)-0i))]2]"1 

>a[a2  +  (02  +aW-02)  ]  } 

=P{(1- 1  p  I  l)[a2  +  (aW)2] 


>[«2(1- 1  p  h)2  +  [02+aW-(0i+p‘(y(O)-01))J2]}. 


(4.27) 


As  t — >0,  the  probability  of  an  incorrect  classification 

Plclassify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2}  -»  P{O>(02+aW  -  y(0))2}  =  0. 

Thus,  the  probability  of  an  incorrect  identification  using  the  Cauchy 
procedure  tends  to  0  as  t->0  for  the  case  in  which  the  Item  associated  with  the 
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Signature  at  time  t  is  different  from  the  Item  associated  with  the  Signature  at 
time  0. 

As  t-^oo,  the  probability  ot  an  incorrect  identification 

P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 


-*p{w2  >  (W  + 


(02-0 1)  2 
2  ~)  }  =  P{W  > 


a 


1 02-01 1. 
2a  ' 


the  same  as  for  the  Gaussian  procedure. 

Hence,  for  the  two  limiting  cases  t-»0  and  t->°°  both  the  Cauchy  and 
Gaussian  procedures  have  the  same  misclassification  probabilities  for  the  case 
in  which  the  Item  associated  with  the  Signature  at  time  t  is  different  than  the 
one  associated  with  the  Signature  at  time  0.  Note  these  are  theoretical 
limiting  results  with  all  parameters  known. 

To  further  explore  the  differences  between  the  Gaussian  and  Cauchy 
procedures  for  the  scenario  of  different  Items  associated  with  Signatures  and 
Cauchy  data,  let 

gt(W,y(0))  =  [aW  +  ©z-Oi-pKyfO)-©!)]2.  (4.28) 

From  (4.26)  for  the  Gaussian  procedure,  the  probability  of  an  incorrect 
classification 

P(classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 

=P{a(oo)2(i-p2t)  in  (i-p2t)  <  («W)2  (l-p2t)  _  gt(W,y(0))). 

For  the  Cauchy  procedure,  the  probability  of  an  incorrect  classification 
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P{classify  as  1 1  Y(0)  =  y(0),  C(0)  =  1,  C(t)  =  2} 


=  P{a2(l- 1  p  I  *)[1- !  p  |  M]  <  (1- 1  p  1 1)  (aW)2  -  g,(W,y(0))} 

=  P{a2(l- Ipl^lph  <  (l-lpl4)  (aW)2 - gt(W,y(0))} 

<  P{o(oo)2(l-p2t)  in  (i_p2t)  <  (l- 1  p  1 1)  (aW)2  -  gt(W,y(0))} 

<P{a(oo)2(l-p2t)  ln(l-p2t)  <  (l-p2t)  (aW)2  -  gt(W,y(0))} 

for  t  sufficiently  small.  Thus,  for  small  t  the  Gaussian  procedure  will  tend  to 

have  more  incorrect  classifications  than  the  Cauchy  procedure  for  the 

scenario  of  Cauchy  data  with  the  Item  associated  with  the  Signature  at  time  t 

being  different  than  the  one  associated  with  the  signature  at  time  0.  This 

effect  is  made  stronger  by  the  fact  that  if  the  Gaussian  procedure  is  used  then 

2  2 
an  estimate  of  o(«> )  will  be  needed.  An  estimate  of  o(°°  )  for  Cauchy  data 

will  tend  to  be  very  large  since  the  Cauchy  distribution  does  not  have  a  finite 

variance.  This  effect  will  be  seen  in  the  simulations  of  the  next  subsection. 

4.3  Results  of  simulation  experiments 

This  subsection  reports  on  results  of  simulation  experiments  to  assess  the 
behavior  of  the  Gaussian  and  Cauchy  classification  procedures  when  they  are 
confronted  with  data  from  the  other  distribution.  For  simplicity  we  assume 
there  are  two  Items.  In  the  first  subsection  the  autoregressive  process 
producing  the  data  is  Gaussian.  In  the  second  subsection  the  autoregressive 
process  producing  the  data  is  Cauchy.  In  both  subsections  classification 
procedures  using  both  the  Cauchy  and  Gaussian  distributional  assumptions 
are  assessed.  In  all  cases  pi  =  P2  =  0.5,  0}  =  1,  82  =  2.  The  simulations  use  the 
LLRANDOM  random  number  generator;  cf.  Lewis  and  Uribe  [1981]. 
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a.  Gaussian  Data 

The  simulation  in  this  subsection  uses  data  from  a  Gaussian 
autoregressive  process.  We  will  assume  the  means  of  the  two  Signatures,  9? 
and  02,  are  known  and  pi  =  P2  =  P  is  also  known.  It  remains  to  assess  values 
for  the  (presumed  known)  scale  parameters  of  the  two  classification 
procedures.  In  particular  what  should  the  scale  parameter  a  =  ai  =  <*2  of  the 
Cauchy  procedure  be  when  it  is  applied  to  Gaussian  data?  To  obtain 
reasonable  values  for  Ot  =  02  =  o  for  the  Gaussian  classification  procedure  and 
a  =  ai  =  <*2  for  the  Cauchy  classification  procedure,  the  following  simulation 
experiment  was  performed.  The  experiment  has  100  replications.  In  each 
replication  100  independent,  standard  normals  are  generated.  For  each 
replication,  the  standard  deviation  of  the  data  is  computed  and  the  maximum 
likelihood  estimate  of  a  is  obtained  numerically  assuming  a  Cauchy  density 
function  of  the  form 

,,  x  1  a 

The  medians  of  the  100  estimates  of  a  and  the  100  standard  deviations  are 
calculated.  The  values  obtained  are  =1.0  a m  =0.607.  Note  that  the 
estimates  of  a  are  using  the  incorrect  model  assumption  of  Cauchy  for  the 
Gaussian  data.  The  value  of  is  used  in  the  Gaussian  procedure  to  classify 
observations.  The  value  of  is  used  in  the  Cauchy  procedure. 

Tables  1  and  2  show  results  for  simulation  experiments  with  500 
replications.  In  each  replication  Y(0)  is  generated  from  a  normal  distribution 
with  mean  0]  and  standard  deviation  c(°o)  =  0/ Vl-p2  with  a=l  and  p  =  0.5. 
For  Table  1  Y(t)  is  generated  from  a  normal  distribution  with  mean 
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m(t)  =  0i  +  p*(Y(O)-0i) 


and  standard  deviation  o(t)  =  o(°°W  l-pl;  namely  the  Signature  observed  at 
time  t  is  from  Item  1.  For  Table  2  Y(t)  is  generated  from  a  normal  distribution 
with  mean  02  and  standard  deviation  o(°°);  namely  the  Signature  at  time  t  is 
from  Item  2. 

In  both  Tables  the  Gaussian  classification  procedure  assumes  6^  =  10  is 
the  correct  standard  deviation.  The  Cauchy  classification  procedure  assumes 
aM  =  0.607  is  the  correct  value  for  a. 

The  values  in  Table  1  suggest  that  when  the  same  Item  is  producing  the 
Signature  at  time  0  and  t,  then  the  Gaussian  procedure  produces  more  correct 
classifications  for  small  time  t.  However,  the  number  of  correct  classifications 
is  the  same  for  both  procedures  for  larger  t. 

The  values  of  Table  2  suggest  that  if  a  different  Item  is  producing  the 
Signature  at  time  t,  then  the  Cauchy  classification  procedure  has  more  correct 
classifications  at  time  t  for  small  t  even  though  the  data  are  Gaussian.  For 
larger  t,  both  procedures  have  the  same  number  of  correct  identifications. 

h.  Cauchy  Data 

In  this  subsection  the  data  arise  from  a  Cauchy  autoregressive  process. 
The  mean  Signatures  of  the  two  Items,  0i  and  02,  are  assumed  known  and  p  = 
pi  =  p2  is  also  assumed  known.  It  remains  to  assess  values  for  the  scale 
parameters  of  the  Gaussian  and  Cauchy  classification  procedures.  In 
particular,  what  should  the  scale  parameter  o  =  ai  =  02  of  the  Gaussian 
procedure  be  when  it  is  applied  to  Cauchy  data? 
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TABLE  1.  GAUSSIAN  DATA 


Item  1  Produces  the  Signature  at  time  0 
Item  1  Produces  the  Signature  at  time  t 


Time 

Fraction  Correct  Identifications 

Number  of  times 

t 

Gaussian  Proc. 

Cauchy  Proc. 

Gaussian  Correct 
/Cauchy 
Incorrect 

Gaussian 

Incorrect/Cauchy 

Correct 

1 

0.77 

0.65 

50 

0 

2 

0.68 

0.67 

5 

0 

5 

0.67 

0.67 

0 

0 

10 

0.70 

0.70 

0 

0 

TABLE  2.  GAUSSIAN  DATA 


Item  1  Produces  the  Signature  at  time  0 
Item  2  Produces  the  Signature  at  time  t 


Time 

Fraction  Correct  Identifications 

Number  of  times 

Gaussian  Proc. 

Cauchy  Proc. 

Gaussian  Correct 
/Cauchy 
Incorrect 

Gaussian 

Incorrect/Cauchy 

Correct 

1 

0.64 

0.71 

0 

38 

2 

0.69 

0.71 

0 

6 

5 

0.64 

0.64 

1 

0 

10 

0.68 

0.68 

0 

0 

To  obtain  reasonable  values  for  o  for  the  Gaussian  classification 
procedure  and  a  for  the  Cauchy  classification  procedure,  the  following 
simulation  experiment  was  performed.  The  experiment  has  100  replications. 
Each  replication  generates  100  standard  Cauchy  random  numbers.  For  each 
replication  the  standard  deviation  of  the  data  is  computed  and  the  maximum 
likelihood  estimate  of  a  is  obtained  numerically.  The  medians  of  the  100 
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estimates  of  a  and  the  130  standard  deviations  are  computed.  The  values 
obtained  are 


aM  =  13.23  and  aM=1.03. 

Note  the  high  value  of  the  standard  deviation. 

Tables  3  and  4  present  results  of  simulation  experiments  in  which  the 
data  are  from  a  Cauchy  autoregressive  process.  All  experiments  have  500 
independent  replications.  For  each  replication  Y(0)  is  generated  from  a 
Cauchy  distribution  with  location  parameter  0i,  and  scale  parameter  1;  that  is. 
Item  1  is  producing  the  Signature  at  time  0.  For  replications  reported  in  Table 
4,  Y(t)  is  generated  from  a  Cauchy  distribution  with  location  parameter  02  and 
scale  parameter  1;  that  is.  Item  2  is  producing  the  Signature  at  time  t.  For 
replications  reported  in  Table  3,  Y(t)  is  generated  from  a  Cauchy  distribution 
having  density  function 

«  \  1  <*(*> 
ux;  "  n  a(t)2  +  (x-m(t))2 

with 

m(t)  =  0]  +  p‘(y(O)-0]) 

and 

a(t)  =  (l-ipf); 

that  is,  Item  1  is  also  producing  the  Signature  at  time  t. 

In  both  Tables  3  and  4,  the  Gaussian  classification  procedure  assumes  a 
standard  deviations  Oi  =  02  =ctm-  The  Cauchy  classification  procedure 
assumes  the  a-parameters  oti  =  012  =  6m  . 

The  results  of  Table  3  indicate  that  for  small  times  t,  if  the  same  Item  is 
producing  the  Signature  at  time  0  and  time  t,  then  the  Gaussian  classification 


31 


procedure  has  more  correct  classifications  even  though  the  data  are  Cauchy. 
For  larger  times  t,  the  number  of  correct  classifications  is  the  same  for  both 
procedures.  On  the  other  hand,  the  results  of  Table  4  indicate  that  if  a 
different  item  is  producing  the  Signature  at  time  t,  then  the  Cauchy 
classification  procedure  has  many  more  correct  classifications  than  the 
Gaussian  procedure  for  small  times  t.  Once  again  the  number  of  correct 
identifications  is  the  same  for  both  procedures  as  t  becomes  larger. 

TABLE  3.  CAUCHY  DATA 


Item  1  Produces  the  Signature  at  time  0 
Item  1  Produces  the  Signature  at  time  t 


Time 

Fraction  Correct  Identifications 

Number  of  times 

Gaussian  Proc. 

Cauchy  Proc. 

Gaussian  Correct 
/Cauchy 
Incorrect 

Gaussian 

Incorrect/Cauchy 

Correct 

1 

0.98 

0.73 

124 

0 

5 

0.72 

0.69 

18 

0 

10 

0.66 

0.66 

0 

1 

TABLE  4.  CAUCHY  DATA 


Item  1  Produces  the  Signature  at  time  0 
Item  2  Produces  the  Signature  at  time  t 


Time 

Fraction  Correct  Identifications 

Number  of  times 

Normal  Proc. 

Cauchy  Proc. 

Normal  Correct 
/Cauchy 
Incorrect 

Normal 

I  ncorrect/Cauchy 
Correct 

1 

0.09 

0.74 

0 

316 

2 

0.14 

0.70 

0 

282 

5 

0.67 

0.61 

0 

29 

10 

0.64 

0.64 

0 

0 
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c.  Summary 

The  differences  in  performance  of  the  two  classification  procedures 
appear  for  small  time  t.  If  the  same  Item  is  producing  Signatures  at  both  0 
and  t,  then  the  Gaussian  classification  procedure  has  more  correct 
classifications  for  small  times  t  for  both  Gaussian  and  Cauchy  data.  If  a 
different  Item  is  producing  the  Signature  at  time  t,  then  the  Cauchy 
classification  procedure  has  more  correct  classifications  for  both  Gaussian  and 
Cauchy  data.  The  effect  is  strongest  if  the  data  are  from  a  Cauchy 
autoregressive  process;  in  this  case  the  Gaussian  procedure  does  very  poorly 
when  different  Items  are  producing  the  Signatures. 

In  summary,  it  is  important  to  realize  that  the  performance  of  a  Bayesian 
classification  procedure  can  be  influenced  by  its  underlying  distributional 
assumptions.  A  classification  procedure  based  on  Gaussian  distributional 
assumptions  can  be  reluctant  to  classify  a  new  observation  coming  from  a 
different  item  as  being  associated  with  a  new  item.  A  classification  procedure 
based  on  Cauchy  distributional  assumptions  can  be  reluctant  to  classify  a  new 
observation  which  comes  from  the  same  item  as  that  being  associated  with 
the  same  item.  Hence,  if  there  is  uncertainty  about  the  underlying 
distribution  of  the  data,  it  might  be  better  to  combine  results  of  several 
classification  procedures  based  on  different  distributional  assumptions. 
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50.  Dr.  John  E.  Rolph . 1 

RAND  Corporation 

1700  Main  St. 

Santa  Monica,  CA  90406 
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51.  Prof.  Linda  V.  Green . 1 

Graduate  School  of  Business 

Columbia  University 
New  York,  NV  10027 

52.  Dr.  David  Burman . 1 

AT&T  Bell  Telephone  Laboratories 

Mountain  Avenue 
Murray  Hill,  NJ  07974 

53.  Dr.  Ed  Coffman . 1 

AT&T  Bell  Telephone  Laboratories 

Mountain  Avenue 
Murray  Hill,  NJ  07974 

54.  Prof.  William  Jewell . 1 

Operations  Research  Department 

University  of  California,  Berkeley 
Berkeley,  CA  94720 

55.  Prof.  D.  C.  Siegmund . 1 

Dept,  of  Statistics,  Sequoia  Hall 

Stanford  University 
Stanford,  CA  94305 

56.  Operations  Research  Center,  Rm  E40-164 . 1 

Massachusetts  Institute  of  Technology 

Attn:  R.  C.  Larson  and  J.  F.  Shapiro 
Cambridge,  MA  02139 

57.  Arthur  P.  Hurter,  Jr . 1 

Professor  and  Chairman 


Dept,  of  Industrial  Engineering  and  Management  Sciences 
Northwestern  University 
Evanston,  IL  60201-9990 


58.  Institute  for  Defense  Analysis . 1 

1800  North  Beauregard 

Alexandria,  VA  22311 

59.  Prof.  J.  W.  Tukey . 1 

Statistics  Dept.,  Fine  Hall 

Princeton  University 
Princeton,  NJ  08540 
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60.  Dr.  Daniel  H.  Wagner . 1 

Station  Square  One 

Paoli,  PA  19301 

61.  Dr.  Colin  Mallows . 1 

AT&T  Bell  Telephone  Laboratories 

Mountain  Avenue 
Murray  Hill,  NJ  07974 

62.  Dr.  D.  Pregibon . 1 

AT&T  Bell  Telephone  Laboratories 

Mountain  Avenue 
Murray  Hill,  NJ  07974 

63.  Dr.  Jon  Kettenring . 1 

Bellcore 

435  South  Street 

Morris  Township,  NJ  07960 

64.  Prof.  David  L.  Wallace . 1 

Statistics  Dept.,  University  of  Chicago 

5734  S.  University  Ave. 

Chicago,  IL  60637 

65.  Dr.  S.  R.  Dalai . 1 

AT&T  Bell  Telephone  Laboratories 

Mountain  Avenue 
Murray  Hill,  NJ  07974 

66.  Dr.  M.  J.  Fischer . 1 

Defense  Communications  Agency 

1860  Wiehle  Avenue 
Reston,  VA  22070 


67.  Dr.  Prabha  Kumar 

Defense  Communications  Agency . 1 

1860  Wiehle  Avenue 
Reston,  VA  22070 

68.  Dr.  B.  Doshi . 1 

AT&T  Bell  Laboratories 

HO  3M-335 
Holmdel,  NJ  07733 
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69.  Dr.  D.  M.  Lucantoni . 1 

AT&T  Bell  Laboratories 

Holdmel,  NJ  07733 

70.  Dr.  V.  Ramaswami . 1 

MRE  2Q-358 


Bell  Communications  Research,  Inc. 

435  South  Street 
Morristown,  NJ  07960 

71.  Prof.  G.  Shantikumar . 1 

The  Management  Science  Group 

School  of  Business  Administration 
University  of  California 
Berkeley,  CA  94720 

72.  Dr.  D.F.  Daley . 1 

Statistic  Dept.  (I.A.S.) 

Australian  National  University 
Canberra,  A.C.T.  2606 
AUSTRALIA 

73.  Dr.  Guy  Fayolle . 1 

I.N.R.I.A. 

Dom  de  Voluceau-Rocquencourt 
78150  Le  Chesnay  Cedex 
FRANCE 


74.  Professor  H.  G.  Daellenbach . 1 

Department  of  Operations  Research 

University  of  Canterbury 
Christchurch,  NEW  ZEALAND 

75.  Koh  Peng  Kong . 1 


OA  Branch,  DSO 
Ministry  of  Defense 
Blk  29  Middlesex  Road 
SINGAPORE  1024 

76.  Professor  Sir  David  Cox . 1 

Nuffield  College 
Oxford,  OXI  INF 
ENGLAND 
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77.  Dr.  A.  J.  Lawrence . 

Dept,  of  Math  Stat.,  Univ.  of  Birmingham 
P.  O.  Box  363 
Birmingham  B1^  2TT 
ENGLAND 

78.  Dr.  F.  P.  Kelly . 1 

Statistics  Laboratory 

16  Mill  Lane 

Cambridge 

ENGLAND 

79.  Dr.  R.  J.  Gibbens . 1 

Statistics  Laboratory 

16  Mill  Lane 

Cambridge 

ENGLAND 

80.  Dr.  John  Copas . 1 

Dept,  of  Math  Stat.,  Univ.  of  Birmingham 

P.  O.  Box  363 
Birmingham  B15  2TT 
ENGLAND 

81.  Dr.  D.  Vere-Jones . 1 

Dept,  of  Math,  Victoria  Univ.  of  Wellington 

P.  O.  Box  196 
Wellington 
NEW  ZEALAND 

82.  Prof.  Guy  Latouche . 1 

University  Libre  Bruxelles 

C.  P.  212,  Blvd.  De  Triomphe 
B-1050  Bruxelles 
BELGIUM 

83.  Naval  Ocean  Systems  Center . 1 

San  Diego,  CA  92152 
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